Estimating Bacterial Diversity for Ecological Studies: Methods, Metrics, and Assumptions
نویسندگان
چکیده
Methods to estimate microbial diversity have developed rapidly in an effort to understand the distribution and diversity of microorganisms in natural environments. For bacterial communities, the 16S rRNA gene is the phylogenetic marker gene of choice, but most studies select only a specific region of the 16S rRNA to estimate bacterial diversity. Whereas biases derived from from DNA extraction, primer choice and PCR amplification are well documented, we here address how the choice of variable region can influence a wide range of standard ecological metrics, such as species richness, phylogenetic diversity, β-diversity and rank-abundance distributions. We have used Illumina paired-end sequencing to estimate the bacterial diversity of 20 natural lakes across Switzerland derived from three trimmed variable 16S rRNA regions (V3, V4, V5). Species richness, phylogenetic diversity, community composition, β-diversity, and rank-abundance distributions differed significantly between 16S rRNA regions. Overall, patterns of diversity quantified by the V3 and V5 regions were more similar to one another than those assessed by the V4 region. Similar results were obtained when analyzing the datasets with different sequence similarity thresholds used during sequences clustering and when the same analysis was used on a reference dataset of sequences from the Greengenes database. In addition we also measured species richness from the same lake samples using ARISA Fingerprinting, but did not find a strong relationship between species richness estimated by Illumina and ARISA. We conclude that the selection of 16S rRNA region significantly influences the estimation of bacterial diversity and species distributions and that caution is warranted when comparing data from different variable regions as well as when using different sequencing techniques.
منابع مشابه
Detecting diversity: emerging methods to estimate species diversity.
Estimates of species richness and diversity are central to community and macroecology and are frequently used in conservation planning. Commonly used diversity metrics account for undetected species primarily by controlling for sampling effort. Yet the probability of detecting an individual can vary among species, observers, survey methods, and sites. We review emerging methods to estimate alph...
متن کاملEstimating bacterial diversity from clone libraries with flat rank abundance distributions.
There are a number of parametric and non-parametric methods for estimating diversity. However all such methods employ either the proportional abundance of the most abundant taxon in a sample or require that a specific taxon is sampled more than once. Consequently, the available methods for estimating diversity cannot be applied to samples consisting entirely of singletons, which might be charac...
متن کاملEstimating extinction with the fossil record
Many ecological and palaeontological studies focus on extinction. The fossil record is particularly important for studying long-term patterns in extinction: although analyses of extant phylogenies can estimate extinction rates (e.g. Alfaro et al. 2009) and even suggest mass extinctions (e.g. Crisp & Cook 2009), they cannot imply trilobites ever existed or that sphenodonts (now represented only ...
متن کاملA Comparison of the Sensitivity of the BayesC and Genomic Best Linear Unbiased Prediction(GBLUP) Methods of Estimating Genomic Breeding Values under Different Quantitative Trait Locus(QTL) Model Assumptions
The objective of this study was to compare the accuracy of estimating and predicting breeding values using two diverse approaches, GBLUP and BayesC, using simulated data under different quantitative trait locus(QTL) effect distributions. Data were simulated with three different distributions for the QTL effect which were uniform, normal and gamma (1.66, 0.4). The number of QTL was assumed to be...
متن کاملتورش روشهای آنالیز استاندارد در برآورد اثرات علیتی
Standard methods for estimating exposure effects in longitudinal studies will result in biased estimates of the exposure effect in the presence of time-dependent confounders affected by past exposure. In the present review article, we first described the assumptions required for estimating the causal effect in longitudinal studies and their structure regarding various types of exposure and ...
متن کامل